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SEQUENCE LISTING 

3 (1) GENERAL INFORMATION: 

5 (i) APPLICANT: Tsien et al . , Roger Y. et al . 

7 (ii) TITLE OF INVENTION: LONG WAVELENGTH MUTANT FLUORESCENT 

8 PROTEINS 
10 (iii) NUMBER OF SEQUENCES: 4 

12 (iv) CORRESPONDENCE ADDRESS: 

13 (A) ADDRESSEE: Fish & Richardson P.C. 

14 (B) STREET: 4225 Executive Square, Suite 1400 

15 (C) CITY: La Jolla 

16 (D) STATE: CA 

17 (E) COUNTRY: USA 

18 (F) ZIP: 92037 
2 0 (v) COMPUTER READABLE FORM: 

21 (A) MEDIUM TYPE: Floppy disk 

22 (B) COMPUTER: IBM PC compatible 
2 3 - (C) OPERATING SYSTEM: PC-DOS/MS -DOS 
24 (D) SOFTWARE: Patentln Release #1.0, Version #1.30 
2 6 (vi) CURRENT APPLICATION DATA: 

C--> 27 (A) APPLICATION NUMBER: US/10/071,976 

C--> 28 (B) FILING DATE: 05-Feb-2002 

33 (C) CLASSIFICATION: 

34 (vii) PRIOR APPLICATION DATA: 

31 (A) APPLICATION NUMBER: 09/465,142 

32 (B) FILING DATE: 1999-DEC-16 

35 (A) APPLICATION NUMBER: 08/974,737 

36 (B) FILING DATE: 

38 (viii) ATTORNEY/AGENT INFORMATION: 

39 (A) NAME: Haile, Lisa A. 

40 (B) REGISTRATION NUMBER: 38,347 

41 (C) REFERENCE/DOCKET NUMBER: 07257/056001 
4 3 (ix) TELECOMMUNICATION INFORMATION: 

44 (A) TELEPHONE: 619/678-5070 

45 (B) TELEFAX: 619/678-5099 
4 8 (2) INFORMATION FOR SEQ ID NO : 1: 

50 (i) SEQUENCE CHARACTERISTICS: 

51 (A) LENGTH: 716 base pairs 

52 (B) TYPE: nucleic acid 

53 (C) STRANDEDNESS : single 

54 (D) TOPOLOGY: linear 

58 (ix) FEATURE: 

59 (A) NAME/KEY: CDS 

60 ' (B) LOCATION: 1..714 



ENTERED 
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6 3 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1: 

65 ATG AGT AAA GGA GAA* GAA CTT TTC ACT GCA GTT GTC CCA ATT CTT GTT 4 8 

66 Met Ser Lys Gly Glu Glu Leu Phe Thr Ala Val Val Pro He Leu Val 

67 1 5 10 15 

6 9 GAA TTA GAT GGT GAT GTT AAT GGG CAC AAA TTT TCT GTC AGT GGA GAG 96 

70 Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu 

71 20 25 30 

73 GGT GAA GGT GAT GTA ACA TAC GGA AAA CTT ACC CTT AAA TTT ATT TGC 144 

74 Gly Glu Gly Asp Val Thr Tyr Gly Lys Leu Thr Leu Lys Phe He Cys 

75 35 40 45 

77 ACT ACT GGA AAA CTA CCT GTT CCA TGG CCA ACA CTT GTC ACT ACT TTC 192 

78 Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Phe 

79 50 55 60 

81 TCT TAT GGT GTT CAA TGC TTT TCA AGA TAC CCA GAT CAT ATG AAA CGG 24 0 

82 Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys Arg 

83 65 70 75 80 

87 CAT GAC TTT TTC AAG AGT GCC ATG CCC GAA GGT TAT GTA CAG CAA AGA 288 

88 His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Gin Arg 

89 85 90 95 

91 ACT ATA TTT TTC AAA GAT GAC GGG AAC TAC AAG ACA CGT GCT GAA GTC 336 

92 Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val 

93 100 105 110 

9 5 AAG TTT GAA GGT GAT ACC CTT GTT AAT AGA ATC GAG TTA AAA GGT ATT 384 

96 Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly He 

97 115 120 125 

99 GAT TTT AAA GAA GAT GGA AAC ATT CTT GGA CAT AAA TTG GAA TAC AAC 4 32 

100 Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr Asn 

101 130 135 140 

103 TAT AAC TCA CAC AAT GTA TAC ATC ATG GCA GAC AAA CAA AAG AAT GGA 4 80 

104 Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn Gly 

105 145 150 155 160 

107 ATC AAA GTT AAC TTC AAA ATT AGA CAC AAC ATT GAA GAT GGA AGC GTT 52 8 

108 He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser Val 

109 165 170 175 

111 CAA CTA GCA GAC TAT TAT CAA CAA AAT ACT CCA ATT CTC GAT GGC CCT 576 

112 Gin Leu Ala Asp Tyr Tyr Gin Gin Asn Thr Pro He Leu Asp Gly Pro 

113 180 185 190 

115 GTC CTT TTA CCA GAC AAC CAT TAC CTG TCC ACA CAA TCT GCC CTT TCG 624 

116 Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu Ser 

117 195 200 205 

119 AAA GAT CCC AAC GAA AAG AGA GAC CAC ATG GTC CTT CTT GAG TTT GTA 6 72 

120 Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val 

121 210 215 220 

12 3 ACA GCT GCT GGG ATT ACA CAT GGC ATG GAT GAA CTA TAC AAA 714 

124 Thr Ala Ala Gly He Thr His Gly Met Asp Glu Leu Tyr Lys 

125 225 230 235 

127 TA 716 
130 (2) INFORMATION FOR SEQ ID NO : 2: 
132 (i) SEQUENCE CHARACTERISTICS: 
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133 






(A) LENGTH: 2 38 amino 


acids 














134 






(B) TYPE: 


amino acid 


















135 






(D) TOPOLOGY: 


linear 


















137 




(ii) 


MOLECULE TYPE: 


protein 


















139 




(xi) 


SEQUENCE DESCRIPTION: ! 


SEQ ! 


ID NO: 2 












141 


Met 


Ser 


Lys 


Glv 


Glu 


Glu 


Leu 


Phe 


Thr 


Ala 


Val 


Val 


Pro 


He 


Leu 


Val 


142 


1 








5 










10 










15 




144 


Glu 


Leu 


Asp 


Glv 


Asp 


Val 


Asn 


Glv 


His 


Lys 


Phe 


Ser 


Val 


Ser 


Glv 


Glu 


145 








20 










25 










30 






14 7 


Gly Glu 


Glv 


Asp 


Val 


Thr 


Tvt 


Glv 


Lys 


Leu 


Thr 


Leu 


Lys 


Phe 


He 


Cys 


148 
















40 










45 








150 


± nr 


Tnr 


Glv 


Lys 


Leu 


Pro 


Val 


Pro 


Tr*D 


Pro 


Thr 


Leu 


Val 


Thr 


Thr 


Phe 


151 




RO 




















60 










_L _> -> 


Seir 


Tyr 


Gly 


Val 


Gin 


Cy s 


Phe 


Ser 


Arcr 


Tvt 


Pro Asp 


His 


Met 


Lys 




154 


D D 










70 










75 










80 


1 ^fi 

1 JO 


u-i c 


Asp 


Phe 


Phe 


Lys 


Ser 


Ala 


Met 


Pro 


Glu 


Gly 


Tyr 


Val 


Gin 


Gin 


Arcr 


157 










85 










90 










95 




1 

A. -J Z> 


Til2T 


He 


Phe 


Phe 


Lys 


Asp 


Asp 


Glv 


As n 


Tvr 


Lys 


Thr 


Arg 


Ala 


Glu 


Val 


160 








1 00 










105 










110 






162 


Lys 


Phe 


Glu 


Gly 


A on 
nop 




Leu 


Val 


As n 


Arg 


He 


Glu 


Leu 


Lys 


Glv 


He 


163 






115 










120 










125 








165 


nop 


Phe 


Lys 


Glu 


Asp 


Gly 


Asn 


He 


Leu 


Gly 


His 


Lys 


Leu 


Glu 


Tvr 

JL 


Asn 


166 




130 










135 










140 










168 


iyr 


Asn 


Ser 


His 


Asn 


Val 


Tyr 


He 


Met 


Ala 


Asp 


Lys 


Gin 


Lys 


Asn 


Glv 


1 69 

J- U Z> 


1 A R 










150 










155 










160 


171 


J. J. £2 


Lys 


Val 


Asn 


Phe 


Lys 


He 


Arg 


His 


Asn 


He Glu Asp Gly 


Ser 


Val 


172 










165 










170 










175 




174 


Gin 


Leu 


Ala 


Asp 


Tyr 


Tyr 


Gin 


Gin 


Asn 


Thr 


Pro 


He 


Leu 


Asp 


Glv 


Pro 


175 








180 










185 










190 






1 77 


Val 


Leu 


Leu 


Pro 


Asp 


Asn 


His 


Tyr 


Leu 


Ser 


Thr 


Gin 


Ser 


Ala 


Leu 


Ser 


1 7fl 






195 










200 










205 








180 


Lys 


Asp 


Pro 


Asn 


Glu 


Lys 


Arg 


Asp 


His 


Met 


Val 


Leu 


Leu 


Glu 


Phe 


Val 


1 fil 




210 










215 










220 










183 


Thr 


Ala 


Ala 


Gly 


lie 


Thr 


His 


Gly 


Met 


Asp 


Glu 


Leu 


Tyr 


Lys 






-L O *± 


225 










230 










235 












186 


(2) 


INFORMATION 


FOR 


SEQ 


ID NO: 3: 
















188 




(i) 


SEQUENCE CHARACTERISTICS: 
















_L O -? 






(A) LENGTH: 72 0 base pairs 














190 






(B) TYPE: 


nucleic 


acid 
















191 






(C) STRANDEDNESS: 


single 
















192 






(D) TOPOLOGY: 


linear 


















196 




(ix) 


FEATURE : 
























197 






(A) NAME/KEY: 


CDS 




















198 






(B) LOCATION: 


1 . .720 


















201 




(xi) 


SEQUENCE DESCRIPTION: SEQ ID NO : 3: 












203 


ATG 


GTG 


AGC 


AAG 


GGC 


GAG 


GAG 


CTG 


TTC 


ACC 


GGG 


GTG 


GTG 


CCC 


ATC 


CTG 


204 


Met 


Val 


Ser 


Lys 


Gly Glu 


Glu 


Leu 


Phe 


Thr 


Gly 


Val 


Val 


Pro 


He 


Leu 


W--> 205 




240 










245 










250 










207 


GTC 


GAG 


CTG 


GAC 


GGC 


GAC 


GTA 


AAC 


GGC 


CAC 


AAG 


TTC 


AGC 


GTG 


TCC 


GGC 
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208 


Val 


Glu 


Leu 


A^sp 


Gly 


Asp 


Val 


Asn 


Glv 


His 


Lys 


Phe 


Ser 


Val 


Ser 


Gly 




W- 


_ s 


Z. KJ J 


9 5 5 

Z J J 










260 










265 










270 








211 


GAG 


GGC 


GAG 


GGC 


GAT 


GCC 


ACC 


TAC 


GGC 


AAG 


CTG 


ACC 


CTG 


AAG 


TTC 


ATC 


144 






Z 1 Z 


c i n 


p 1 v 


Cl ii 


C"| v 


Asp 


Ala 


Thr 

1 11X 


Tvr 
lyx 


Cl v 


T.VC! 


lie u 


Thr 


Leu 




Phe 


Tl e 

11C 




W- 


- > 


213 










275 










280 










285 










215 


TGC 


ACC 


ACC 


GGC 


AAG 


CTG 


CCC 


GTG 


CCC 


TGG 


CCC 


ACC 


CTC 


GTG 


ACC 


ACC 


192 






9 1 

Z X D 


c\7 c 


TVi t* 

X 1 1 X 


Thr* 

± 1 IX 


p 1 v 


T.vc 
xiy o 


XlC Ll 


SZ X \J 


Va 1 
v a x 


Pro 


ir F 


t i w 


Thr 


Leu 


Val 
val 


TVi t 

X 1 IX 


TVt t* 
i in 




W- 


_ s 


z x / 








9Q0 










9Q 

Z _7 J 










■J \J \J 












91 Q 

Z X _7 


ttc 


ppp 


TAT 


GGC 


GTG 


CAG 


TGC 


TTC 


GCC 


CGC 


TAC 


CCC 


GAC 


CAC 


ATG 


AAG 


240 






99D 

Z Z VJ 


jr nc 


p "I v 


iyr 


pi v" 
VJ X y 


Val 


Cl n 
win 


P\r Q 

V-» y o 


Phe 


Ala 

/Aid 


A TCT 


xyx 


Pro Asp 


Hi 

nio 


Met" 
1 ic i— 


Lys 




W- 
ri 


- > 


991 

Z Z _L 






305 










310 










jlD 














99 "3 

Z Z <J 


CAG 


CAG 


GAC 


TTC 


TTC 


AAG 


TCC 


GCC 


ATG 


CCC 


GAA 




m a P 

iAL 


GTC 


CAG 


GAG 


288 






9 9 A 
z z 


p 1 n 


cl n 

X 11 


Asp 


Ir 1 1 w 


Ph<=» 

It I IC 


ii_y o 


OCX 


Ala 

ir\ 1 a. 


1*1 C l_ 


Prn 


Cl n 


biy 


Tyr 


Va 1 
v ct i 


Cl n 


Cl n 




W- 


_ S 


99 ^ 

Z Z J 




•Jon 
o z u 










J Z J 










OOK) 
















9 97 

Z Z / 


ccc 




ATP 


mmp 


mmp 


A AC 


CAC 


GAC 


CCC 


AAC 


TAC 




A PP 

ALL 


CGC 


GCC 


GAG 


336 






9 9ft 
z z O 


Arg 


i nr 


Tl 

11c 


Oho 

trie 


lr lie 


Lys 


Asp 


Asp 


Cl \7 


Asn 


iyr 


Lys 


my. f~ 

j. ni 


Arg 


Al a 

r\ X a. 


Cl 1 1 
VjI Ll 




W- 


_ s 


99Q 

Z Z 27 


1 ^ 5 
j j j 










340 










345 










350 








9^1 


\J 1 VJ 


A AC 


rpmp 


PAP 


CCC 


CAC 


ACC 


CTC 


CTC 


AAC 


CCC 


AIL 


PAP 

LAL 


CTC 


AAC 


GGC 


384 






9^9 
z J z 


Val 


Lys 


rflc 


Pi ll 
vlU 


Pi xr 


Asp 


Thr 


Leu 


V^ 1 
v a. X 


Asn 


Arg 


T 1 i= 
11c 


Pi ii 
ulU 


Leu 




Cl v 

L7l jr 




w - 


_ «v 


Z J J 










o c c 
J J J 










JUu 










J U J 










9 1 R 
Z J _> 


Amp 


CAC 


rpmp 
1 1 L^ 


A AC 


CAC 


CAC 


CCC 


AAC 


ATC 


CTC 


CCC 


LAC 


A A P 

AAb 


CTC 


GAG 


TAC 


4 32 






9 "3 

Z J D 


lie 


Asp 


rflc 


Lys 


Asp 


Asp 


Pl \7 

biy 


Asn 


Tl p» 
x x c 


Leu 


Cl \7 


nlS 


Lys 


XlC Ll 


Cl ii 

\J 1 Ll 


iyr 




ri — 


s 


O 7 
Z J / 








O 1 \J 










17 ^ 
J / D 










i ft n 












9 

Z O _7 


r\r\L— 


TAC 


A AC 


ACC 


CAC 


AAC 


CTC 


TAT 


ATC 


ATG 


GCC 


A P 

LAL 


A A P 
AAb 


CAG 


AAG 


AAC 


480 






Z *i U 


As n 


xyr 


As n 


OCX 


nib 


A G n 
nail 


Va 1 

V cl X 


±yr 


Tl <=» 

11C 


ric l. 


Ala 

nld 


Asp 


Lys 


Gin 


Lys 


Asn 




W- 


_ % 


941 






T ft 5 

O O J 










son 










jyi) 














94 ^ 

z t J 


PPP 


ATT" 

nlU 


AAG 


GTG 


AAC 


TTC 


AAG 


ATC 


CGC 


CAC 


AAC 


A TP 
AIL 


PAP 
LAL 


GAC 


GGC 


AGC 


528 






9 Ad 

z *± *t 


cl v 


T 1 <=» 
lie 


j_i_y o 


val 


A c n 
noil 


php 
r jic 


T,vc 


Tl ^ 

X X c 


rAi y 


nx o 


Ion 
ndll 


He 


Glu 


Asp 


Glv 

J; 


Ser 




W- 

Ti 


_ s 


94 5 

Z ft J 




a no 

ft u u 










4 0 S 

■* VJ -J 










410 
















Z f _7 


PTC 


PAP 




CCC 


GAC 


CAC 


TAC 


CAG 


CAG 


AAC 


ACC 


CCC 


ATC 


GGC 


GAC 


GGC 


576 






9 SO 

Z Ju 


Va 1 
val 


Gin 


Pro 


Ala 


Ron 


n x o 


Tyr 


Gin 


Gin 


A cm 
noil 


TVi t 

1 111 


Pro 


He 


Gl v 
oxy 


Asp 


Gly 




w - 


_ «y 


O ^ 1 
Z D 1 


AIR 










420 




















T J W 








253 


CCC 


GTG 


CTG 


CTG 


CCC 


GAC 


AAC 


CAC 


TAC 


CTG 


AGC 


TAC 


CAG 


TCC 


GCC 


CTG 


624 






9 ^4. 


r? -r~ r~> 

IT X LV 


val 


Leu 


Leu 


Pro 


Asp 


Asn 


His 


Tyr 


Leu 


Cp-r 
OCI 


Tyr 


Gin 


Ser 


Ala 


Leu 




W- 




9 ^ ^ 
/ j j 










435 










440 










445 










9 S7 

Z J / 


ACC 


AAA 


GAC 


CCC 


AAC 


GAG 


AAG 


CGC 


GAT 


CAC 


ATG 


GTC 


CTG 


CTG 


GAG 


TTC 


672 






9 Rft 
zoo 


C d T~ 


Lys 


Asp 


Pro 


Asn 


Glu 


Lys 


Arg 


Asp 


His 


Met 


Val 


Leu 


Leu 


Glu 


Phe 




W- 


- > 


259 








450 










455 










460 












261 


GTG 


ACC 


GCC 


GCC 


GGG 


ATC 


ACT 


CAC 


GGC 


ATG 


GAC 


GAG 


CTG 


TAC 


AAG 


TAA 


720 






262 


Val 


Thr 


Ala 


Ala 


Gly 


He 


Thr 


His 


Gly 


Met 


Asp 


Glu 


Leu 


Tyr 


Lys 


* 




W- 


-> 


263 






465 










470 










475 














266 


(2) 


INFORMATION 


FOR 


SEQ 


ID NO : 4 : 






















268 




(i) 


SEQUENCE CHARACTERISTICS: 






















269 






(A) LENGTH: 239 amino 


acids 




















270 






(B) TYPE: 


amino acid 
























271 






(D) TOPOLOGY: 


linear 
























273 




(ii) 


MOLECULE TYPE: 


protein 
























275 




(xi) 


SEQUENCE DESCRIPTION: SEQ ID NO : 4: 


















277 


Met 


Val 


Ser 


Lys 


Gly Glu Glu 


Leu 


Phe Thr Gly Val 


Val 


Pro 


He 


Leu 
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278 


1 








5 










10 










15 




9 ftn 

Z O V 


V dl 




Toil 
lie U. 


A on 




no \J 


Val 


Asn 


Gly 


His 


Lys 


Phe 


Ser 


Val 


Ser 


Gl v 
^xy 


281 








20 










25 










30 






9 fl ^ 
zoo 


ri~\ n 

\J X u 






vjx y 




AT ^ 
rix d 


X 1 IX 


Tyr Gly. Lys 


Leu 


Thr 


Leu 


Lys 


Phe 


He 


284 






35 










A 0 










45 








zoo 


v--y o 


T> Vi t* 
± I1X 


X J IX 


vji y 


Xjy o 


XjC Li 


Pro 

IT X \J 


Vdl 


C X \J 


Trp 


Pro 


Thr 


Leu 


Val 


Thr 


Thr 


9 fl7 
z o / 
























60 










Z O 


rile 




iyr 


pi Tr 

j_ y 


V d X 


Vj X [1 


Pirn 

y o 




a 1 ^ 


Arg 


Tyr 


Pro 


Asp 


His 


Met 


Lys 


Z -7 U 


D J 




















75 










80 


9 Q 9 






Asp 


rJlc 




Lys 


Ser 


Hid 




Pro Glu Gly Tyr Val 


VJlll 




9 Q 7 

Z -7 J 










o 










90 










95 




Z -7 3 


Arg 


Tnr 


Tl & 
lltr 






±jy o 


A on 


Asp 


oiy 


Asn 


Tyr 


Lys 


Thr 


Arg 


Ala 


Glu 


z y d 








i no 










105 










110 






0 Q Q 


Vdl 


Ly s 


irne 


Pi 11 
^ J.U 


Vjj. y 


Asp 


Tnr 


Leu 


Val 


Asn 


Arg 


He 


Glu 


Leu 


Lys 




9 Q Q 






1 1 c 

ii j 










120 










125 








i m 

jUI 


Tic* 

lie 


Asp 




Lys 


Asp 


Asp 


\jj-Y 


Asn 


He 


Leu 


Gly 


His 


Lys 


Leu 


Glu 


xyr 


^09 




1 10 

1 










1 -3 R 
X J ~> 










140 












Asn 


Tyr 


Ann 

Asn 


ber 


HIS 


Asn 


vai 


Tyr 


He 


Met 


Ala 


Asp 


Lys 


Gin 


Lys 


Asn 


305 


145 










150 








155 










160 


307 


Gly 


He 


Lys 


Val 


Asn 


Phe 


Lys 


He 


Arg 


His 


Asn 


He 


Glu 


Asp 


Gly 


Ser 


308 










165 










170 










175 




310 


Val 


Gin 


Pro 


Ala 


Asp 


His 


Tyr 


Gin 


Gin 


Asn 


Thr 


Pro 


He 


Gly Asp 


Gly 


311 








180 










185 










190 






313 


Pro 


Val 


Leu 


Leu 


Pro 


Asp 


Asn 


His 


Tyr 


Leu 


Ser 


Tyr 


Gin 


Ser 


Ala 


Leu 


314 






195 








200 










205 








316 


Ser 


Lys 


Asp 


Pro 


Asn 


Glu 


Lys 


Arg 


Asp 


His 


Met 


Val 


Leu 


Leu 


Glu 


Phe 


317 




210 










215 










220 










319 


Val 


Thr 


Ala 


Ala 


Gly 


He 


Thr 


His 


Gly Met Asp Glu 


Leu 


Tyr 


Lys 




320 


225 










230 










235 
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